Big Data Curation

نویسنده

  • Renée J. Miller
چکیده

A new mode of inquiry, problem solving, and decision making has become pervasive in our society, consisting of applying computational, mathematical, and statistical models to infer actionable information from large quantities of data. This paradigm, often called Big Data Analytics or simply Big Data, requires new forms of data management to deal with the volume, variety, and velocity of Big Data. Many of these data management problems can be described as data curation. Data curation includes all the processes needed for principled and controlled data creation, maintenance, and management, together with the capacity to add value to data. In this talk, I describe our experience in curating some open data sets. I overview how we have adapted some of the traditional solutions for aligning data and creating semantics to account for (and take advantage of) Big Data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The use of Lean Six Sigma methodology in digital curation

In this paper, we give an overview about the current research in Big Data and Digital Curation with a focus on Lean Six Sigma and discuss how this methodology can help the Digital Curation lifecycle. For instance, the application of the Lean Six Sigma methodology is presented and discussed with a special focus on the selection, preservation, maintenance, collection and archiving of digital info...

متن کامل

Big Sensing Data Curation in Cloud Data Center for Next Generation IoT and WSN

Modern sensing devices play a pivotal role in achieving data acquisition, communication and dissemination for the Internet-of-Things (IoT). Naturally, IoT applications and intelligent sensing systems supported by sensing devices, such as wireless sensor networks (WSN), are closely coupled. Modern intelligent sensing systems generate huge volumes of sensing data, well beyond the processing capab...

متن کامل

Big Data to Knowledge—Harnessing Semiotic Relationships of Data Quality and Skills in Genome Curation Work

This article aims to understand the views of genomics scientists with regard to the data quality assurances associated with semiotics and Data-Information-Knowledge (DIK). The resulting communication of signs generated from genomic curation work, was found within different semantic levels of DIK that correlate specific data quality dimensions with their respective skills. Syntactic DQ dimension...

متن کامل

Spatial Big Data: Platforms, Analytics, and Science

Emerging non-traditional spatial datasets from geo-social media, sensor networks, and volunteers are important due to societal applications such as situation assessment after natural disasters, monitoring urban traffic, etc. However, such datasets, called spatial big data, often exceed the capacity of commonly used spatial computing platforms. Spatial big data presents new challenges for their ...

متن کامل

Study of the foundation, models and issues of research data curation and management in scientific and academic environments

Background and Aim: The purpose of this paper is to study, identifying and discuss the foundation and concepts, models and frameworks, dimensions and challenges of research data curation and management in scientific and academic environments. Method: This article is a review article and library method was used to collect scientific and research texts in this field. In this research, external an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014